On the approximability of the exemplar adjacency number problem for genomes with gene repetitions

نویسندگان

  • Zhixiang Chen
  • Bin Fu
  • Randy Goebel
  • Guohui Lin
  • Weitian Tong
  • Jinhui Xu
  • Boting Yang
  • Zhiyu Zhao
  • Binhai Zhu
چکیده

In this paper, we apply a measure, exemplar adjacency number, which complements and extends the well-studied breakpoint distance between two permutations, to measure the similarity between two genomes (or in general, between any two sequences drawn from the same alphabet). For two genomes G andH drawn from the same set of n gene families and containing gene repetitions, we consider the corresponding Exemplar Adjacency Number problem (EAN), in which we delete duplicated genes from G and H such that the resultant exemplar genomes (permutations) G and H have the maximum adjacency number. We obtain the following results. First, we prove that the one-sided 2-repetitive EAN problem, i.e., when one of G and H is given exemplar and each gene occurs in the other genome at most twice, can be ∗Corresponding Author. Email addresses: [email protected] (Zhixiang Chen), [email protected] (Bin Fu), [email protected] (Randy Goebel), [email protected] (Guohui Lin), [email protected] (Weitian Tong), [email protected] (Jinhui Xu), [email protected] (Boting Yang), [email protected] (Zhiyu Zhao), [email protected] (Binhai Zhu) Preprint submitted to Theoretical Computer Science July 7, 2014 linearly reduced from the Maximum Independent Set problem. This implies that EAN does not admit any O(n)-approximation algorithm, for any ǫ > 0, unless P = NP. This hardness result also implies that EAN, parameterized by the optimal solution value, is W[1]-hard. Secondly, we show that the two-sided 2-repetitive EAN problem has an O(n)-approximation algorithm, which is tight up to a constant factor.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximability and Fixed-Parameter Tractability for the Exemplar Genomic Distance Problems

In this paper, we present a survey of the approximability and fixed-parameter tractability results for some Exemplar Genomic Distance problems. We mainly focus on three problems: the exemplar breakpoint distance problem and its complement (i.e., the exemplar non-breaking similarity or the exemplar adjacency number problem), and the maximal strip recovery (MSR) problem. The following results hol...

متن کامل

Non-breaking Similarity of Genomes with Gene Repetitions

In this paper we define a new similarity measure, the nonbreaking similarity, which is the complement of the famous breakpoint distance between genomes (in general, between any two sequences drawn from the same alphabet). When the two input genomes G and H, drawn from the same set of n gene families, contain gene repetitions, we consider the corresponding Exemplar Non-breaking Similarity proble...

متن کامل

A Pseudo-boolean Programming Approach for Computing the Breakpoint Distance Between Two Genomes with Duplicate Genes

Comparing genomes of different species has become a crucial problem in comparative genomics. Recent research have resulted in different genomic distance definitions: number of breakpoints, number of common intervals, number of conserved intervals, Maximum Adjacency Disruption number (MAD), etc. Classical methods (usually based on permutations of gene order) for computing genomic distances betwe...

متن کامل

On the Approximability of Comparing Genomes with Duplicates

A central problem in comparative genomics consists in computing a (dis-)similarity measure between two genomes, e.g. in order to construct a phylogenetic tree. A large number of such measures has been proposed in the recent past: number of reversals, number of breakpoints, number of common or conserved intervals, SAD etc. In their initial definitions, all these measures suppose that genomes con...

متن کامل

The Zero Exemplar Distance Problem

Given two genomes with duplicate genes, Zero Exemplar Distance is the problem of deciding whether the two genomes can be reduced to the same genome without duplicate genes by deleting all but one copy of each gene in each genome. Blin, Fertin, Sikora, and Vialette recently proved that Zero Exemplar Distance for monochromosomal genomes is NP-hard even if each gene appears at most two times in ea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Theor. Comput. Sci.

دوره 550  شماره 

صفحات  -

تاریخ انتشار 2014